智能论文笔记

Artifact Identification in X-ray Diffraction Data using Machine Learning Methods

Howard Yanxon , James Weng , Hannah Parraga , Wenqian Xu , Uta Ruett , Nicholas Schwarz

分类：计算机视觉 | 机器学习

2022-07-29

研究人员高度利用了原位同步加速器高能X射线粉末衍射（XRD）技术，可以分析功能设备（例如电池材料）或复杂样品环境中材料的晶体结构反应堆）。材料的原子结构可以通过其衍射模式以及详细的分析（例如Rietveld的细化）来识别，该分析表明测量的结构如何偏离理想结构（例如内部应力或缺陷）。对于原位实验，通常在不同条件下（例如绝热条件）在同一样本上收集一系列XRD图像，产生不同的物质状态，或者简单地作为时间的时间连续收集，以跟踪样品的变化超过化学或物理过程。原位实验通常与区域探测器一起进行，收集由理想粉末的衍射环组成的2D图像。根据材料的形式，人们可能会观察到除现实样本及其环境的典型Debye Scherrer环以外的其他特征，例如纹理或优选方向以及2D XRD图像中的单晶衍射点。在这项工作中，我们介绍了对机器学习方法的研究，以快速可靠地识别XRD图像中的单晶衍射点。在XRD图像整合过程中排除伪影的排除允许精确分析感兴趣的粉末衍射环。我们观察到，当用高度多样的数据集对较小的子集进行训练时，梯度提升方法可以始终如一地产生高精度的结果。与常规方法相比，该方法大大减少了识别和分离单晶斑所花费的时间。

translated by 谷歌翻译

BIC: Twitter Bot Detection with Text-Graph Interaction and Semantic Consistency

Zhenyu Lei , Herun Wan , Wenqian Zhang , Shangbin Feng , Zilong Chen , Qinghua Zheng , Minnan Luo

分类：人工智能

2022-08-17

Twitter机器人检测是一项重要且有意义的任务。现有的基于文本的方法可以深入分析用户推文内容，从而实现高性能。但是，新颖的Twitter机器人通过窃取真正的用户的推文并用良性推文稀释恶意内容来逃避这些检测。这些新颖的机器人被认为以语义不一致的特征。此外，最近出现了利用Twitter图结构的方法，显示出巨大的竞争力。但是，几乎没有一种方法使文本和图形模式深入融合并进行了交互，以利用优势并了解两种方式的相对重要性。在本文中，我们提出了一个名为BIC的新型模型，该模型使文本和图形模式深入互动并检测到推文语义不一致。具体而言，BIC包含一个文本传播模块，一个图形传播模块，可分别在文本和图形结构上进行机器人检测，以及可证明有效的文本互动模块，以使两者相互作用。此外，BIC还包含一个语义一致性检测模块，以从推文中学习语义一致性信息。广泛的实验表明，我们的框架在全面的Twitter机器人基准上优于竞争基准。我们还证明了拟议的相互作用和语义一致性检测的有效性。

translated by 谷歌翻译

TwiBot-22: Towards Graph-Based Twitter Bot Detection

Shangbin Feng , Zhaoxuan Tan , Herun Wan , Ningnan Wang , Zilong Chen , Binchi Zhang , Qinghua Zheng , Wenqian Zhang , Zhenyu Lei , Shujie Yang

分类：人工智能

2022-06-09

Twitter机器人检测已成为打击错误信息，促进社交媒体节制并保持在线话语的完整性的越来越重要的任务。最先进的机器人检测方法通常利用Twitter网络的图形结构，在面对传统方法无法检测到的新型Twitter机器人时，它们表现出令人鼓舞的性能。但是，现有的Twitter机器人检测数据集很少是基于图形的，即使这些基于图形的数据集也遭受有限的数据集量表，不完整的图形结构以及低注释质量。实际上，缺乏解决这些问题的大规模基于图的Twitter机器人检测基准，严重阻碍了基于图形的机器人检测方法的开发和评估。在本文中，我们提出了Twibot-22，这是一个综合基于图的Twitter机器人检测基准，它显示了迄今为止最大的数据集，在Twitter网络上提供了多元化的实体和关系，并且与现有数据集相比具有更好的注释质量。此外，我们重新实施35代表性的Twitter机器人检测基线，并在包括Twibot-22在内的9个数据集上进行评估，以促进对模型性能和对研究进度的整体了解的公平比较。为了促进进一步的研究，我们将所有实施的代码和数据集巩固到Twibot-22评估框架中，研究人员可以在其中始终如一地评估新的模型和数据集。 Twibot-22 Twitter机器人检测基准和评估框架可在https://twibot22.github.io/上公开获得。

translated by 谷歌翻译

Unsupervised Approaches for Out-Of-Distribution Dermoscopic Lesion Detection

Max Torop , Sandesh Ghimire , Wenqian Liu , Dana H. Brooks , Octavia Camps , Milind Rajadhyaksha , Jennifer Dy , Kivanc Kose

分类：计算机视觉 | 机器学习

2021-11-08

有限的作品显示无监督的分布（OOD）方法对复杂的医疗数据的功效。在这里，我们展示了我们无监督的OOD检测算法，SIMCLR-LOF的初步调查结果，以及在医学图像上应用的最近现实方法（SSD）的最新状态。SIMCLR-LOF使用SIMCLR学习语义有意义的功能，如果测试样本是ood的，则使用LOF进行评分。我们在多源国际皮肤成像协作（ISIC）2019数据集上进行了评估，并显示与SSD竞争的结果以及应用于同一数据的最近监督方法。

translated by 谷歌翻译

Knowledge Graph Augmented Political Perspective Detection in News Media

Shangbin Feng , Zilong Chen , Wenqian Zhang , Qingyao Li , Qinghua Zheng , Xiaojun Chang , Minnan Luo

分类：自然语言处理 | 人工智能

2021-08-09

识别新闻媒体的政治观点已成为政治评论的快速增长和日益极化的政治意识形态的重要任务。以前的方法专注于文本内容，留出富裕的社会和政治背景，这在论证挖掘过程中至关重要。为了解决这一限制，我们提出了一种政治透视检测方法，包括外部域知识。具体而言，我们构建一个政治知识图形，以作为特定于域的外部知识。然后我们利用异质信息网络来代表新闻文件，共同模仿新闻文本和外部知识。最后，我们采用关系图神经网络，并作为图形级分类进行政治视角检测。广泛的实验表明，我们的方法始终如一地实现了两个现实世界的透视检测基准的最佳性能。消融研究进一步承担了外部知识的必要性以及我们基于图形的方法的有效性。

translated by 谷歌翻译

Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation

Yue Han , Jiangning Zhang , Zhucun Xue , Chao Xu , Xintian Shen , Yabiao Wang , Chengjie Wang , Yong Liu , Xiangtai Li

分类：计算机视觉

2023-01-03

Few Shot Instance Segmentation (FSIS) requires models to detect and segment novel classes with limited several support examples. In this work, we explore a simple yet unified solution for FSIS as well as its incremental variants, and introduce a new framework named Reference Twice (RefT) to fully explore the relationship between support/query features based on a Transformer-like framework. Our key insights are two folds: Firstly, with the aid of support masks, we can generate dynamic class centers more appropriately to re-weight query features. Secondly, we find that support object queries have already encoded key factors after base training. In this way, the query features can be enhanced twice from two aspects, i.e., feature-level and instance-level. In particular, we firstly design a mask-based dynamic weighting module to enhance support features and then propose to link object queries for better calibration via cross-attention. After the above steps, the novel classes can be improved significantly over our strong baseline. Additionally, our new framework can be easily extended to incremental FSIS with minor modification. When benchmarking results on the COCO dataset for FSIS, gFSIS, and iFSIS settings, our method achieves a competitive performance compared to existing approaches across different shots, e.g., we boost nAP by noticeable +8.2/+9.4 over the current state-of-the-art FSIS method for 10/30-shot. We further demonstrate the superiority of our approach on Few Shot Object Detection. Code and model will be available.

translated by 谷歌翻译

AI in HCI Design and User Experience

Wei Xu

分类：人工智能

2023-01-03

In this chapter, we review and discuss the transformation of AI technology in HCI/UX work and assess how AI technology will change how we do the work. We first discuss how AI can be used to enhance the result of user research and design evaluation. We then discuss how AI technology can be used to enhance HCI/UX design. Finally, we discuss how AI-enabled capabilities can improve UX when users interact with computing systems, applications, and services.

translated by 谷歌翻译

More is Better: A Database for Spontaneous Micro-Expression with High Frame Rates

Sirui Zhao , Huaying Tang , Xinglong Mao , Shifeng Liu , Hanqing Tao , Hao Wang , Tong Xu , Enhong Chen

分类：计算机视觉

2023-01-03

As one of the most important psychic stress reactions, micro-expressions (MEs), are spontaneous and transient facial expressions that can reveal the genuine emotions of human beings. Thus, recognizing MEs (MER) automatically is becoming increasingly crucial in the field of affective computing, and provides essential technical support in lie detection, psychological analysis and other areas. However, the lack of abundant ME data seriously restricts the development of cutting-edge data-driven MER models. Despite the recent efforts of several spontaneous ME datasets to alleviate this problem, it is still a tiny amount of work. To solve the problem of ME data hunger, we construct a dynamic spontaneous ME dataset with the largest current ME data scale, called DFME (Dynamic Facial Micro-expressions), which includes 7,526 well-labeled ME videos induced by 671 participants and annotated by more than 20 annotators throughout three years. Afterwards, we adopt four classical spatiotemporal feature learning models on DFME to perform MER experiments to objectively verify the validity of DFME dataset. In addition, we explore different solutions to the class imbalance and key-frame sequence sampling problems in dynamic MER respectively on DFME, so as to provide a valuable reference for future research. The comprehensive experimental results show that our DFME dataset can facilitate the research of automatic MER, and provide a new benchmark for MER. DFME will be published via https://mea-lab-421.github.io.

translated by 谷歌翻译

Surveillance Face Anti-spoofing

Hao Fang , Ajian Liu , Jun Wan , Sergio Escalera , Chenxu Zhao , Xu Zhang , Stan Z. Li , Zhen Lei

分类：计算机视觉

2023-01-03

Face Anti-spoofing (FAS) is essential to secure face recognition systems from various physical attacks. However, recent research generally focuses on short-distance applications (i.e., phone unlocking) while lacking consideration of long-distance scenes (i.e., surveillance security checks). In order to promote relevant research and fill this gap in the community, we collect a large-scale Surveillance High-Fidelity Mask (SuHiFiMask) dataset captured under 40 surveillance scenes, which has 101 subjects from different age groups with 232 3D attacks (high-fidelity masks), 200 2D attacks (posters, portraits, and screens), and 2 adversarial attacks. In this scene, low image resolution and noise interference are new challenges faced in surveillance FAS. Together with the SuHiFiMask dataset, we propose a Contrastive Quality-Invariance Learning (CQIL) network to alleviate the performance degradation caused by image quality from three aspects: (1) An Image Quality Variable module (IQV) is introduced to recover image information associated with discrimination by combining the super-resolution network. (2) Using generated sample pairs to simulate quality variance distributions to help contrastive learning strategies obtain robust feature representation under quality variation. (3) A Separate Quality Network (SQN) is designed to learn discriminative features independent of image quality. Finally, a large number of experiments verify the quality of the SuHiFiMask dataset and the superiority of the proposed CQIL.

translated by 谷歌翻译

Benchmarking the Robustness of LiDAR Semantic Segmentation Models

Xu Yan , Chaoda Zheng , Zhen Li , Shuguang Cui , Dengxin Dai

分类：计算机视觉

2023-01-03

When using LiDAR semantic segmentation models for safety-critical applications such as autonomous driving, it is essential to understand and improve their robustness with respect to a large range of LiDAR corruptions. In this paper, we aim to comprehensively analyze the robustness of LiDAR semantic segmentation models under various corruptions. To rigorously evaluate the robustness and generalizability of current approaches, we propose a new benchmark called SemanticKITTI-C, which features 16 out-of-domain LiDAR corruptions in three groups, namely adverse weather, measurement noise and cross-device discrepancy. Then, we systematically investigate 11 LiDAR semantic segmentation models, especially spanning different input representations (e.g., point clouds, voxels, projected images, and etc.), network architectures and training schemes. Through this study, we obtain two insights: 1) We find out that the input representation plays a crucial role in robustness. Specifically, under specific corruptions, different representations perform variously. 2) Although state-of-the-art methods on LiDAR semantic segmentation achieve promising results on clean data, they are less robust when dealing with noisy data. Finally, based on the above observations, we design a robust LiDAR segmentation model (RLSeg) which greatly boosts the robustness with simple but effective modifications. It is promising that our benchmark, comprehensive analysis, and observations can boost future research in robust LiDAR semantic segmentation for safety-critical applications.

translated by 谷歌翻译